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T-H Abstract. Let M„ be a random matrix of size nxn and let Ai, . . . , An be the eigenvalues 

^^ of Mn- The empirical spectral distribution fiM„ of M„ is defined as 



1 



H ^^M„ {s, t) = ^#{fc < n, 3ff(Afc) < s; 9(Afc) < t}. 

^H The circular law theorem in random matrix theory asserts that if the entries of M„ 

^H are i.i.d. copies of a random variable with mean zero and variance a^ , then the empirical 

r~-«» spectral distribution of the normalized matrix -;^y^M„ of Mn converges almost surely to 

CNj the uniform distribution /icir over the unit disk as n tends to infinity. 

I I In this paper we show that the empirical spectral distribution of the normalized matrix 

^^ of Mn, a random matrix whose rows are independent random (—1,1) vectors of given 

r ) row-sum s with some fixed integer s satisfying \s\ < (1 — o(l))n, also obeys the circular 

• law. The key ingredient is a new polynomial estimate on the least singular value of M„. 

I ^1 1. Introduction 

^ Let Mn be a matrix of size nxn and let Ai, . . . , A^ be the eigenvalues of M„. Then the 

^ empirical spectral distribution (ESD) fj,M„ of M^ is defined as 

On 

^ 1 

cn mnis,t) = -#{k < n,K(Afe) < s;9(Afc) < t}. 

O "- 

(N 

^^ We also define //cir cis the uniform distribution over the unit disk, 



1 



X 

H Hcir{s,t) = -mes{\z\ < l;^{z) < s,Q{z) < t). 

Confirming a long standing conjecture in random matrix theory, a recent result of Tao and 
Vu (appendix by Krishnapur) proves a universal law for the ESD of random i.i.d. matrices. 



Theorem 1.1. |31J Assume that the entries of Mn are i.i.d. copies of a complex random 
variable of mean zero and finite non-zero variant 
converges to ficir almost surely as n tends to (X). 



variable of mean zero and finite non-zero variance a"^, then the ESD of the matrix — ^M„ 



The proof of this result is built upon previous important developments of Girko [9l [10] , Bai 
[1], Gotze-Tikhomirov [11], Pan-Zhou |20j . Tao-Vu [27] and many others. 
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In view of universality phenomenon, it is of importance to study the law for random matrices 
of non-independent entries. Probably one of the first results in this direction is due to 
Bordenave, Caputo and Chafai [3] who prove the law for random Markov matrices. 

Theorem 1.2. [3, Theorem 1.3] Let X be a random matrix of size n x n whose entries 
are i.i.d. copies of a non-negative continuous random variable with finite variance a^ and 
bounded density function. Then with probability one the ESD of the normalized matrix 
y/nX , where X = {xij)i<ij<n and Xij := Xij/{xii + ••• + Xin), converges weakly to the 
circular measure /Ucir- 



In particular, when xn follows the exponential law of mean one. Theorem 1.2 establishes 



the circular law for the Dirichlet Markov ensemble (see also ^). We remark that the 



assumptions of continuity and boundedness are crucial in the proof of Theorem 1.2 



Related results with "linear" assumption of independence include a result of Tao, who 
among other things proves the circular law for random zero-sum matrices. 

Theorem 1.3. [24^, Theorem 1.13] Let X be a random matrix of size n x n whose entries 
are i.i.d. copies of a random variable of mean zero and variance one. Then the ESD of 
the normalized matrix -y^X , where X = {xij)i<ij<n and Xij := xij — ^{xn + • • • + Xm), 
converges almost surely to the circular measure ^cir- 

The main goal of this note is to showing that the circular law also holds for random discrete 
matrices of similar weak constraints. 

Theorem 1.4 (Main result). Let < e < 1 be a positive constant. Let Mn be a random 
(—1,1) matrix of size nxn whose rows are independent vectors of given row-sum s with 
some s satisfying \s\ < (1 — e)n. Then the ESD of the normalized matrix — y=Mn, where 

o"^ = 1 — (-)^, converges almost surely to the distribution ficir as n tends to oo. 

To some extent, our matrix is a discrete version of the random Markov matrices considered 



in Theorem 1.2 where the entries are restricted to itl/s. However, it is probably more 
suitable to compare our model with that of random Bernoulli matrices. By Theorem |1.1[ 
the ESD of the normalized random Bernoulli matrices obeys the circular law, and hence 
our Theorem 11.41 serves as a local version of the law. 



We remark that in a very recent result |T8j, the first author is able to prove a similar 
law for random doubly stochastic matrices, thus confirming the universality principle for 
another type of matrix of independent entries. Although the results are similar in spirit, 
the difficulties in each note are very different. The main obstacle of this note is to study the 
singularity of M„ and its perturbed variants. Inverse techniques developed in the literature 
to deal with this problem do not seem to suffice. This leads us to a new development to 
be discussed in Section ^ Note that our approach may also cover the regime n — s = o{n) 
but we do not attempt to do so here. In what follows we present some reduction steps to 
simplify our problem. 

Observe that, by letting X„_i be the submatrix generated by the first n — \ rows and 
columns of M„, the spectra of M„ is the union of s and the spectra of the pertubed matrix 
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Figure 1. The ESD of a random matrix of size 1000 by 1000 whose rows 
are (—1,1) vectors of zero-sum, picture by Philhp Woods. 

Xn-i — Fn-i where ah of the rows of F„,-i are identical copies of (r?i„,i, . . . , ?7in(n-i))) here 
by rriij we mean the ij-th entry of M„. 

Indeed, consider the matrix M := M„ — A/„. We have 



det(M) = det(M'), 

where M' is obtained from M by adding its first n — 1 columns to its last one. 
On the other hand, we also have 



det(M') = (s - A) det(M") 



where 



M" :-- 



(mil - A 

"^(n-l)l 

V runi 



"^l{n-l) 



m 



(„_i)(„_i) 

"T'n(n-l) 



1\ 



A 1 
1/ 



It is clear that det(Af") = det(M"'), where M'" := (X„_i-F„_i)-A/„_i. Thus the spectra 
of Mn is indeed the union of s and the spectra of the pertubed matrix X„_i — Fn-i- 



The observation above suggests a way to prove Theorem |1.4| by looking at the ESD of 
Xn~i — Fn-i- This alternative helps us avoid the outlier eigenvalue s of M„ which may 
cause certain technical difficulty for any direct study on M^. 
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Notice that the rows of Xn-i above are independent vectors chosen uniformly from the set 



of all (—1,1) vectors of row-sum either s — 1 or s + 1. So for Theorem 1.4 it suffices to show 
the following. 

Theorem 1.5 (Circular law for pertubed matrices). Let X^ be a random. (—1,1) matrix 
whose rows are independent random vectors of row-sum either s — 1 or s + 1 with given s 
satisfying \s\ < (1 — e)n. Let Fn be a deterministic matrix whose rows are identical copies 
of a given (—1, 1) vector f. Then the ESD of ^^A^(X„ + F„), where a^ = 1 — (f )^, converges 
almost surely to the distribution of fi^ir «s n tends to oo. 

For short, by S we denote the set of all (—1, 1) vectors x = (xi, . . . , x„) of row-sum either 
s — 1 or s -|- 1. To establish Theorem 1 1. 5 1 we will relate Xn to a random matrix X!^ whose 
entries are i.i.d. copies of a random Bernoulli variable x of the following form 



{P(X — 1) — 2 2n' /'1^ 

P(^ = l) = 5 + t- ^' 

It is known that the ESD of — ^(^^ + F'n) converges uniformly to ^cir (see for instance 
|31] Corollary 1.15]). As we desire to pass this result to Xn + Fn, we will make use of a so 
called replacement principle below. 

Theorem 1.6. |3H Theorem 2.1] Suppose for each n that An = (ojj), Bn = {bij) are random 
matrices of size n x n. Assume that 



the sum 



-^ J^(|ai,f + |6i '2^ 



n^ '--' 



is bounded almost surely; 

for almost all complex numbers z 



- log I det(— =^„ - zLn)\ log I det{—=Bn - zLn) 

n \/n n \ln 



converges almost surely to zero. 
Then /i j_^ — fJ'J-B converges almost surely to zero. 

In application, Xn + Fn plays the role of An and X^ + Fn plays that of Bn- It is clear that 



the first condition of Theorem 1.6 is satisfied. Thus for Theorem 1.5 it suffices to justify 
the second condition. 

Theorem 1.7. For every fixed complex z we have 



- log I det((X„ + Fn) - zV^Ln)\ - - log | det((X; + F„) - z^/^In 
n n 
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converges to zero almost surely. 



We will outline a proof for Theorem 1.7 in the next section 



Notation. Here and later, asymptotic notations such as O, ft, B, and so for, are used under 
the assumption that n — )• oo. A notation such as Oc{-) emphasizes that the hidden constant 
in O depends on C. 

For 1 < s < n, we denote by Bs the unit vector (0, . . . , 0, 1, 0, . . . , 0), where all but the s-th 
component are zero. For a real or complex vector v = (vi, . . . ,Vn), we use the shorthand 
||v|| for its L2-norm (^^ |wjp)^'^- 

For a matrix M, we use the notation rj(M) and Cj(M) to denote its i-th row and j'-th 
column respectively. For an event A, we use the subscript Px(vl) to emphasize that the 
probability under consideration is taking according to the random vector x. 

2. Proof of Theorem II. 7t outline 

Let fi, . . . , fn denote the (deterministic) rows of F„ + y^z/^, and let xi, . . . ,x„ as well as 
x'^ , . . . , x'„ be the rows of X„ and X'^ respectively. 

For each i > 2, let Vi-i be the space spanned by xi + fi, . . . , Xj_i + fj_i and let dist(xj + 
fj, Vi-i) be the distance from Xj + fj to Vi-i. Define similarly for V^_i and dist(x'^ + fi, Vl_i). 
By the "base times height" formula we have 



log I det((X„ + Fn) - zVnIn)\ = J];iogdist((xi + f^), Vi-i] 

i 
= Y^ logdist((Xi + fi), Vi-l) + Y^ logdist((x, + fi), V^^l) 

:=logS'i + log5'2; 
and similarly, 

log I det((X; + Fn) - z^/^In)\ = J^logdist((x'i + fi), Vi^i] 

i 

= Y logdist((x^ + f.), VU) + ^ logdist((x: + fi), y/_i) 

i<m m<i 

:=logS'i + logS'2. 
where we set the threshold ?n, to be tti := n — log n. 



In order to compare log 
will show the following. 



det((X„ + Fn) - zy/nin) with log det((X; + Fn) - Zy/nl, 



we 
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Theorem 2.1. With probability 1 — exp(— log ~°' ' n) we have 



-I log Si -log 5; I = 0(log-2n). 
n 

Theorem 2.2. With probability 1 — 0{n^^^^) we have 

\\\ogS2\ + \\o^S'^\) = 0{\og'n/n). 
n 



It is clear that Theorem 11.71 follows from Theorem 12.11 and Theorem 12.21 In what follows 
we outline the approach to prove these results. 



2.3. Sketch of the proof of Theorem 2.1 , One of the main ingredients is the following 
row replacement principle. 

Lemma 2.4. Let i be an integer between 1 and m. Let xi, . . . ,Xj,x^,xJ_)_^, . . . ,x^ be m + 1 
independent vectors where the Xj 's are random vectors of type S and x^ 's are random vectors 
whose components are i.i.d copies of x from M. Assume that volj is the m-dimensional 
volume of the parallelepiped generated 6y xi + fi, . . . , Xj + fj, x'^_|_-^ + fj+i, . . . , x^ + f^ and 
voli_i is that of the parallelepiped generated by xi + fi, ... , Xj_i + fj_i, x'^ + fj, . . . , x^ + f^. 
Then we have 



Pxi,...,x„x;,x;;^,,...,x;, ( I logvoli - l0gV0li_l| = 0(log ^ ^) j = 1 - exp(- log^ "^^^ u) . 



Lemma 2.1 then follows by a repeatedly use of Lemma 2.4 and the triangle inequality using 
the fact that ^i and S{ are volumes of the parallelepipeds generated by xi + fi , . . . , x^ + fm 
and by x'^ + fi , . . . , x'^ + f^ respectively. 



We now justify Lemma 2.4 We express volj as volj = d x vol, where d is the distance from 
Xj + fj to the space V spanned by xi + fi, . . . , Xj_i + fj_i, x-_,_^ + f-^-^^, . . . , x'^ + f^, and vol 
is the volume of the parallelepiped generated by these vectors. Similarly we can express 
volj_i as volj_i = d' X vol, where d' is the distance from x^ + fj to V. 

Thus we have 

I log(volj) - log(voli_i)| = I logd - logd'|. 

We will next see that d and d' are almost identical with very high probability. 

Let f be a fixed vector (whose coordinates may depend on n). In what follows we denote 
the translation f + (s/n, . . . , s/n) of f by f . 

Lemma 2.5. Assume that V C C" is a subspace of dimension dim(y) = k < n — 10. Let 

x' = [x'l, . . . ,x'„) be a random vector where x[ are i.i.d. copies of x from ([I]) and let d' be 
the distance from x' + f to V . Then for any t > we have 
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Px'd^' - \n-k + dl\>t + 'i)< exp(- 



where df is the distance from f to V . 



Lemma |2.5| can be proved by using a well-known result of Talagrand; we defer its proof to 
Section [3 



As E(^^x'j) = s and Var(^^ j;^) = 6(n), the probability that a random vector x' belongs 
to the set of (—1, 1) vectors of row-sum s + 1 (or s — 1) is Q{l/^/n). Furthermore, condition 
on x' £ S, x' is uniformly distributed over these sets. We thus infer from Lemma 2.5 the 
following. 

Corollary 2.6. Let x be a vector uniformly sampled from S and let d be the distance from 
X + f to V . Then for any t > we have 



Px(|d- 



n 



k + dl\>t + 3) = 0{^/nexp{- — )). 



One immediate consequence of Lemma 



2.5 



2.6 



is that if k < n — log n, then 



and Corollary 

by setting t = logn, d is nonzero with probability at least 1 — 0(exp(— log ~°^ n)). By 
applying this fact m times, we conclude that all the volj are non-zero with probability at 
least 1 — 0(exp(— log ^°^ ' n). So it is safe to assume that V has dimension exactly m — 1 

,x^ + fm. Next, by applying 



for any V spanned by xi + fi , 
Lemma 



2.5 



respect to Xj and x' we have 



|Xi_l + fi_l,Xj^]^ + fj_,_]^, 



and Corollary 2.6 once more, with probability 1 — 0(exp(— log °^ n)) with 



\d- 



n — m + \ + dii\ < log n 



and 



d' — ^ n — 7TT- + 1 + d?, I < log n. 



It then follows that 



logd — logd'l < log(l + 



21ogn 



log n — log n 



0(log ^n), 



completing the proof of Lemma 2.4 
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2.7. Sketch of the proof of Theorem 2.2[ Our key lemma here is to showing that the 
least singular value of X^ + F„ + z^/nln, for any fixed complex number z, is at least n~^^^' 
with probability 1 — 0{n~^^^). 

Theorem 2.8. Assume that F is a deterministic complex matrix of size n x n such that 
\fij\ < n^ for some constant 7. Then for any B > there exists A > depending on B and 
7 such that 



V{an[Xn + F)<n-^) <0{ 



n 



-B\ 



This theorem is an analog of the Bernoulli counterpart X'^ + F whose proof can be found 
in either {32j or in other papers of the second author with Tao such as [301 ISTj |28] . Unfor- 



tunately, these proofs do not seem to cover Theorem 3.1 in any trivial way. Henceforth a 



large part of this note will be devoted to prove it, starting from Section [3| 

We next invoke the following two linear algebra results. 

Lemma 2.9 (Cauchy's interlacing law). [B.1,, Lemma A.l] Let A he a matrix of size n x n 
and A' be the suhmatrix formed by the first n — k rows of A. Let cri{A) > • • • > a'n{A) > 
be the singular values of A, and similarly for A' . Then we have 



(Ti{A) > ai{A') > ai+k{A) 



for every 1 < i < n — k. 

Lemma 2.10 (Negative second moment). \31\ Lemma A. 4] Let 1 < n' < n, and let A' be 

a full rank matrix of size n' by n with singular values ai{A') > ■ ■ ■ > crn{A') > and rows 
ri, . . . , r„' G C". For each I < i < n' , let Wi be the subspace generated by the n' — 1 rows 



ri, 



Ti-i, rj+i, 



, r„/ . Then we have 



E-^\A') 



i=l 



J]dist-2(ri,t^,; 



We now prove Theorem |2.2[ By Theorem 2.8 we can assume that xi + f, . . . , x„ + f spans 
the whole space R" with probability at least 1 — 0{n~^^^), and so in particular all the 
Vi have full rank. Applying Lemma 2.10 for the matrix A' generated by the first k rows 
xi + f , . . . , Xfc + f with any k > m = n — log n, we obtain the following with probability at 
least 1 - O(n-i°0) 



dist-2(xfc + f , Vk-i) < Y, ^r'(^') = 0{n^^' 



i=l 



where in the RHS estimate we applied Lemma 2.9 and then Theorem 2 A 
Thus for any k > m 
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0(n-0(i)) ^ ^.g^(^^ ^ f^ y^_^-^ < 11^^ ^ f II ^ Q(^)_ (2) 



Similarly, by applying the known variant of Theorem 2.8 for (X^ + F„) — Z\/nln and by 
Lemmas 12.91 and 12.101 we also have 



0(n-«(i)) = dist(xl + f, VU) = O(V^). (3) 

Owing to the estimates ([2]) and ([3]), we infer that 



'(^dlog^sl + I log 5^1) = 0(log9n/n)) = 1 - 0(r 



proving Lemma 2.2 



3. The least singular value bound 



For the reader's convenience, we restate Theorem 12.81 below. 



Theorem 3.1. Assume that F is a deterministic com,plex m.atrix such that \fij\ < n^ for 
some constant 7. Then for any B > there exists ^4 > depending on B and 7 such that 



-A\ / r^t^-B^ 



P a„(X„ + F) < n-^) <0{n 



This section is devoted to provide an overview of our approach to prove Theorem 3.1 More 
details of the proofs will be discussed in subsequent sections. 



We use the shorthand X for the matrix Xn + F. To prove Theorem 3.1, we assume that 
there exist vectors a and b in C" such that 1 1 all = 1, ||b|| < n~ and 



Xa = b. 

We next consider two cases. 

Case 1. X is non-singular. Let C{X) = {cij{X)), 1 < i,j < n, be the matrix of the 
cofactors of X. We then have 

C{X)h = det(X) • a. 
Thus 



||C(X)b|| = |det(X)|. 
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By paying a factor of n in probability, without loss of generality we can assume that 

\cn{X)bi + ... cin{X)bn\ > I det(X)|/ni/2. 
Note that ||b|| < n^ , thus by Cauchy-Schwarz inequality 



^|cH(X)p>n2^-idet(X)2. (4) 

i=l 

We next express det(X) as a linear form of its first row ri(X) = (xi + /n, . . . , x^ + fin) 
det(Q) = xiCii{X) -\ (- XnCin{X) + riiCii(X) H h ri„ci„(X). 



Thus, with c := i/X^j cij(X)2 (which is 7^ as (cn, . . . , ci„) / 0), Q can be rewritten as 



< n-^+1/2. 



2;^^llW + . . . + ^^^l!iW + l(/^^cn W + • • • + /inCi„(Q)) 
c c c 



Roughly speaking, our approach to prove Theorem 3.1 consists of two main steps. 

• Step 1. Condition on X' , the matrix of the last n — 1 rows of X, if 

supP2;i,...,x„(| > Xj v\<n ^) > n '^, 

then there is a strong structure among the cofactors cu. 

• Step 2. The probability, with respect to X' , that there is a strong additive structure 
among the cu is negligible. 

We pause to discuss the structure mentioned in the inverse step. A set Q C C is a GAP of 
rank r if it can be expressed as in the form 

Q = {50 + ^151 + • • • + krQrlki £ "ZijKi < ki < K[ for all 1 < f < r} 

for some {go, . . . , <7r) G C^+' and (i^i, . . .,Kr), {K[, . . . , K',) G Z^ 

It is convenient to think of Q as the image of an integer box B := {{ki, . . . , kr) G 'Z/\Ki < 
ki < K'^\ under the linear map <I> : (A;i, . . . , k^) 1— ?• 50 + ^i^i + • • • + k^Qr- 

The numbers gi are the generators of Q, the numbers K'^ and Ki are the dimensions of 
Q, and yo\{Q) := \B\ is the size of B. We say that Q is proper if this map is one to one, 
or equivalently if \Q\ = vol(Q). For non-proper GAPs, we of course have \Q\ < vol{Q). If 
—Ki = K'- for alH > 1 and go = 0, we say that Q is symmetric. 

We are now ready to state our steps in details. 
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Theorem 3.2 (Step 1). Let < a < 1/2 he a given constant. Assume that 

n 
Pn-A{{vi, . . .,Vn}) ■■= SWpPxi,...,x„{\'^XiVi - v\ < n~^) > n~^ 

1=1 

for some sufficiently large A, where Vi = cii{X)/c. Then, there exists a vector u = 
(ui, . . . ,Un) and a real number /3 of the form ji^^+'^iP^+^+'y) where < k < A/{10B + 
10 + 27), k £ Zi such that the following holds. 

• ||u|| X 1 and \{u,ri{X))\ < I3n^ '^'^^^ for n — 1 rows rj of X. 

• There exists a generalized arithmetic progression Q* of rank Oq,s(1) and size \Q*\ = 
max ( 1, 0o,b((/0o^5s+4+7({^1; • • • j^n}))"^/"-" ) ) which contains at least n—n^''^^"' 
complex numbers Ui. 

• All the components of Ui and of the generators of Q* are rational numbers of the 
formp/q, where \p\,\q\ < n ^^ . 



Roughly speaking, the quantity {p*r,g_^_4^^ ({""ij • • • ; ^n})) ^ appearing in the bound of |Q* | 
guarantees that the containment is economical. 

In the second step of the approach, we show that the probability for Q' having the above 
properties is negligible. 

Theorem 3.3 (Step 2). With respect to X' , the probability that there exists a vector u and 



a number j3 as in Theorem 3.2 is exp(— J7(n)) 



We remark here that the choice of a being near 1/2 would optimize the probability bound 



in Theorem 3.3 However, we prefer to keep a abstract to demonstrate the flexibility of our 
approach. 

We now study the remaining case. 

Case 2. X is singular. We show that the probability of this event is bounded by 0{n^^) 
for any B > 0, where the implied constant depends on B. The approach is identical (if not 
easier) to that of Case 1. 

First of all, by paying a factor of n in probability and without loss of generality, it suffices 
to consider the event that xi + fi belongs to the subspace generated by X2 + £2, • . . , x„ + f„. 
We show 

Theorem 3.4. Assume that X„ is a random matrix whose rows xi, . . . , x„ are independent 
random vectors sampled uniformly from S. Then for any B > 

P(xi + fi belongs to the subspace H generated 6y X2 + f2, . . . ,x„ + f„) = 0{n'' ), 
where the implied constant depends on B. 
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Condition on X2, . . . ,x„, let v = {vi, . . . ,Vn) be a unit vector which is orthogonal to H. 
Then the probability that xi + fi = (xi + /n, . . . , x„ + fin) belongs to H is bounded by 
Pxi,...,xr^{xivi -\ h XnVn + {fuVi H h finVn) = 0), and SO crudely by 



P(xi +ii & H) < supPa;i,..,,a;„(|xii;i H \-XnVn - v\ < n ). 



We again apply Theorem |3.2| to obtain a structural vector u, and then use Theorem |3.3| to 
conclude that the probability for the existence of such u is negligible, completing the proof 
of Theorem 13.41 



The rest of the paper is organized as follows. In Section [4] we introduce our key lemmas. 
Theorems 3^ and 3^ will be proven in Sections [5] and |6] respectively. 



4. The main tools for proving Theorem 13.21 

We need to study the concentration of Yli ^i^i in a small ball, where x = (xi, . . . , x„) is 
sampled uniformly from the set S of all (—1,1) vectors of row-sum either s — 1 or s + 1. As 
customary, we first study a similar problem for x', a random vector whose components are 
i.i.d. copy of the Bernoulli variable x defined in M. 

Let V = {vi, . . . , Vn] be a multiset in R , where d is a fixed integer. For /3 > 0, we define 
the svfiall hall probability as 



P/3{V) ■= supPx'(viXi H hf„x'„ e B{v,/3)), 

where by B{v,fi) we denote the closed disk of radius fi centered at v in R'^. 

A well-known result of Erdos [6] and Littlewood-Offord |16] asserts that \ivi are real numbers 
of magnitude \vi\ > /3, then 



p^(F) = 0(n-V2). 

This remarkable inequality has generated an impressive way of research. We refer the 
reader to |121 [T5\ [T9l [28] and the references therein for further discussion regarding these 
developments. 

In the reverse direction, we would like to find the underlying reason as to why the small 
ball probability is large (say, polynomial in n). 

Typical examples of V, where pjj is large, involve generalized arithmetic progressions intro- 
duced in the previous section. 

Example 4.1. Let Q = {X][=i f^i9i\ ~ Ki < ki < Ki} be a proper symmetric GAP of 
rank r = 0(1) and size N = n '^•' in R . Assume that for each Vi there exists Qi ^ Q 
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such that \\vi — q\\ < 6. Then, because the random sum X^j QiX^ takes value in the GAP 
nQ := {X][=i f^i9i\ ~ "^Ki < ki < riKi}, and because \nQ\ < n^N = n^^' , the pigeon-hole 
principle implies that ^^ qixi takes some value in nQ with probability n^^^'. Thus we have 



Pn5(T^)=n-«W. (5) 

The above example shows that if Vi are close to a GAP of rank 0(1) and size n*^*-^) in 
R*^, then V has large small ball probability. It was shown by Tao and the second author 
in [301 EU ESI El] , and by the current authors in ^19j that these are essentially the only 
examples of large small ball probability. We present here a somewhat optimal version. 

We say that a vector v is 6-close to a vector q if \\v — q\\ < 6. We say that v is (5-close to a 
set Q if there exists q G Q such that v is (5-close to q. 

Theorem 4.2 (Continuous Inverse Littlewood-Offord theorem for Bernoulli distribution). 
|191 Theorem 2.9] Let < a < 1/2; < C be constants. Let /3 > be a parameter 
that may depend on n. Suppose that V = {vi, . . . ,Vn} is a multi-subset of IV^ such that 
Y17=i ll^«lP ~ -'- ^'^^ ^^'^^ ^ ^'^^ large small ball probability 



P := Pp{V) > n-^, 

where in the definition of pp we assume x'i,...,x'^ to be i.i.d. copies of the Bernoulli 
random variable x defined in M . Then for any number n" < n' < n, there exists a proper 
symmetric GAP Q = {X][=i ^ifii • \ki\ < LCi} such that the following holds. 



(Full dimension) There exists a/j,^^ <C A; ^ vrj/ such that the dilate P := (P/k) ^ ■ 
Q contains the discrete hypercube {0, l}"^. Furthermore P is an integral set, P C Z*^. 

(Approximation) At least n — n' elements ofV (counting multiplicity) are 0{^)-close 
to Q. 

(Small rank and cardinality) Q has constant rank d < r = 0(1), and small cardi- 
nality 



IQI = max (l, O„,rf,c(/0"'n'(-'+'^)/2)y 



• (Small generators) There is a non-zero integer p = 0{y/n') such that all steps 
gi of Q have the form Qi = {ga, . . .,gid), where gij = /3 • ^ with pij £ Z and 

Pij = 0{f3-^V^). 

We note that [19| Theorem 2.9] was originally stated for more general distribution of the 
x'-. Another slight difference is that we require P to be a subset of Z here. However, this 
additional fact is not new as it has been explicitly verified in the proof of Theorem 2.9 (see 
the last part of [191 Section 6]). 
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Remark 4.3. As noticed in [19, Corollary 2.10], the above theorem implies that if we use 
a coarser structure (wh ich 0(/3)-approximates the Vi rather than 0(/3/A:)-approximates as 
stated in Theorem 



4.2), then we can obtain a bound of at most max(0(p ^/vr7),l) in 



the size of Q. As it turned out, the saving factor l/Vn' here plays a crucial role in any 



applications of Theorem 4.2 in the literature. 



From now on we will be mainly working with R^ (equivalently, C). Our method natur ally 
extends to R*^ for any fixed d but we do not attempt to do so here. To prove Theorem 3.2 



we need to modify our notion of concentration probability as follows. Let V = {vi, . . . , f„} 
be a multiset in R^. For any /3 > 0, we define 



p*s{V) := sup Px(t'lXi H \-VnXn G B{v,f3)), 

where the probability is taken uniformly over all (—1, 1) vectors x = (xi, . . . ,Xn) of given 
entry sum s, where |s| < (1 — e)n. (In later application we will set s to be either s — 1 or 

s + l.) 

By definition, p* is invariant under translation. One observes that for any (3 and V we have 

p^{v) = n{p}{v)/V^). (6) 



This relation suggests that if p* := p*o(y) is large, then Theorem 4.2 (more precisely. Remark 



4.3) implies that all the Vi can be approximated by a GAP Q of size 0{{p*)^\/n/\'n'). 
This bound, unfortunately, falls short for any application as the saving factor ^Jnl\fn' here 
is greater than 1 (we refer the reader to Remark 6.5 of Section [6] for more explanation). 



The above discussion shows that a sole application of (|6]) is not enough to obtain a useful 
inverse result regarding p* . In the following result, by using the extra translation invariance 
property of p*, we provide a more economical inverse result. 

Theorem 4.4 (Inverse Littlewood-Offord result with respect to p*). Suppose that V = 
{vi, . . . ,Vn} is a multi-subset ofR? such that J2i=i lb* IP = ^ '^''^d that 



P* ■■= pUV) > n 



-c 



for some fi = 0{n "^ ^^). Then for any number n'^ < n' < n there exists a proper GAP 
Q* = {go + Z][=i ^i9i '■ \h\ < Ki] such that 

• At least n — n' elements of V are (5 n^^'^^ -close to Q* . 

• Q* has small rank r = 0(1), and small cardinality 



\Q\ = max (l, 0,,c((p*)~' V^/n')) • 
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There is a non-zero integer p = 0{vn') such that all steps gi = {gii,gi2),0 <i<r 
of Q* have the form gij = /? • ^ with pij S Z and pij = 0{l5^^vri'). 



Note that the approximation in this case is not as fine as in Theorem 4.2 'or as in Remark 



4.3) and the structm'e Q* is not necessarily symmetric. On the other hand, the size of 
Q* is bounded by 0{{p*)~'^y/n/n'), which is considerably smaller than 0{{p*)~^ ^/n / yrij) 
obtained by ([6]). 



Before proving Theorem 4.4 , let us provide a useful fact whose proof is simple and hence 
omitted. 

Fact 4.5. Assume that P = {kigi + • • • + kr-gr\ — Ki < ki < Ki} is a proper synnmetric 
GAP which contains wi, . . . , Wr, where each Wi can he written as kngi + • • • + ki^gr-, kij £ 

^ 1 I l^ij I S -K j . 

(i) Assume that the vectors kj = (kn, . . . ,kir),l < i < r, have full rank in R''. Then 
we can express each generator gi as gi = ynwi + ■ ■ ■ + VirWr, where yij are rational 
numbers of the form p/q with |p|, \q\ = Or{\PY)- 

(a) Assume that k,. belongs to the space spanned by ki, . . . ,kr_i, then we can write k^ 
as kj. = yiki + • • • + yr-i^r-i, where yi are rational numbers of the form p/q with 
\p\,\q\=Ori\Pn- 

We now proceed to justify the main result of this section. 



Proof, (of Theorem 4.4) Define a new set U C R as 

U = {ui,...,Un} ■■= It,- (^1>^)i---> o • (^n,^=)|- 

By definition, we have ^^ ||iii|P = 1 and p*a{y) = P*qi2^)- Thus, by ([6]) 



Pf,l2{U) = n{p}/2{U)/Vn) = ^{p}{V)/Vn) = 0(n-^-i/2). 



We apply Theorem 4.2 to U to obtain two GAPs Q and P = (/3/2) ^k • Q respectively. 



First, observe that if the rank r of Q (and P) is at least 5, then 

\Q\ = 0((/,*)-iV^/n'('-3)/2) = 0{{p*)-'V^/n'), 

and so we are done by letting Q* be the GAP generated by the first two coordinates of the 
generators of Q. Note that go = because Q is homogeneous. Also, we obtained a very 
good approximation (of order 0{f3/k)) in this case. 

Next we observe that r cannot be 3. Assume otherwise that P = {X^j=i ^j9i : l^il < 
Ki}, where gi = {gn, gi2, gn) £ Z'^ are the generators of P. Because P C Z^ and it 
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contains (1,0,0), (0, 1,0) and (0,0, 1), by Fact|4.5| (i) the generators gi must have the form 



igii,9i2,gi3) where \gij\ are bounded by 0{\P\%But P has size 0{ppj^(U)) = 0(n'^+^/2), 

thus \gij\ = 0(n^'-^"*"^'^). As a consequence, all of the elements of P must have norm at most 
0{n ~^'^). However, this is impossible because as one of the elements of P is 0(l)-close to 
an element of {/3/2k)~^ • U, its second coordinate must be of order at least ^^ , which is 
greater than n*^*-^+^ by the assumption of /3 of being sufficiently small. 

We now consider the case r = i, P = {X]i=i ^i9i ■ l^il < ^i}, where gi = {gii,gi2,giz) G Z^. 
Let (wi, 0) • • • ) (wn-n'J) be the elements of P which are 0(l)-close to n — n' elements of 
the dilated set {/3/2k)~^ ■ U. Apparently / = Q{/3~^k/^/n). We next consider two cases. 



Case 1. If all \\wi\\ are smaller that n ~^'^, then we would be done because in this case the 
order of all ||uj|| is at most 0((/3/2A;)n^*-^"'"^), which is bounded by /3n'^*-^+^. 

Case 2. Assume otherwise that, say ||wi|| > n ~^'^. Consider the following elements of 
P, bi := (l,0,0),b2 := (0, l,0),b3 := (0,0,1) and b4 := (wi,l). Because \\wi\\ is greater 



than n^'-^"'"^, one checks that the condition of Fact 4.5 (ii) does not hold for bi,b2,b3 and 



b4. We thus apply Fact 4.5 (i) to conclude that each gi can be expressed as in the form 



Cjibi + Ci2h2 + Cjsbs + Ci4b4, where Cij = p/q and \p\, \q\ = 0{n'^'~^+'^) 

Next, consider any b = {wiQ, I) from the set {{wi,l), . . . , {wn-n'i 01- There exist fci, /c2, fes, A;4 G 
Z, \ki\ < Ki, such that b = kigi + ^252 + ^353 + ^454; and so 



b = (felCii + fe2C21 + fesCsi + /C4C4i)bi + {kiCi2 + X2k22 + X3C32 + k4C42)^2 
+ {kiCi3 + A;2C23 + ksCss + A;4C43)b3 + (kiCu + A;2C24 + k3C34 + A;4C44)b4. 

Notice that I = Q{j3^^k/ y/n) > n'^^^'^^^, meanwhile \k1C13 + /C2C23 + k^css + /C4C43I = 
Q^^5C+5/2-^ and \kici4 + k2C24 + A:3C34 + /C4C44I = &{n~^^'-^~^) as Cij are rational numbers 
whose denominators are bounded by 0(n^'^"'"*^) and kicu + A;2C24 + ^3034 + /C4C44 cannot be 
zero. We conclude that the coefficients of b3 and b4 must be and 1 respectively. 

It thus follows that, by considering the first two coordinates of bi and b2, 

\\Wio - l^lll^ = ((^iCll + A;2C21 + A;3C3i + A;4C4i)^ + {kiCu + k2C22 + fc3C32 + fc4C42)^) 

= 0(n5^+5/2) < n^c+3^ 

Combining Case 1 and 2, we infer that if r = 4 then all but n' elements of V are Pn^'-'''^^- 
close to a common point. To complete the proof, we just simply set go = f3n^'^~^^ ■ p be 
this approximated point where p is a complex number of integral coordinates and \p\ < 
p-i^-5C-3_ We set other generators to be zero. 

D 
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We now deduce an important corollary of Theorem 4.4 which, similarly to the result of 
Erdos and Littlewood-Offord, states that as long as the multi-set V is not too degenerated 
(for a given /3), its concentration probability p* must be small. 

Corollary 4.6. Let < a < 1/2 be a positive constant and let n' be a number satisfying 
^i/2+a < 72' < n. Assume that j3 < n~^^ and V is a multi-set in R^ so that any of its n — n' 
elements cannot be I3n^ -close to a common point. Then we have 



pliV) = OiV^/n'). 



Proof, (of Corollary 4.6) Assume otherwise that p*a{V) > Cy/n/n' for some large constant 
C to be chosen. So 



p*iV)>Cn-^/\ 



We next apply Theorem 4.4 to V to obtain a GAP Q* which is Pn^^''^ to all but n — n' 



elements of V. Notice that because there are no more than n — n — 1 elements of V that 
are /3n^-close to one common point, Q* must have size at least 2. On the other hand, from 



the conclusion of Theorem 4.4, assuming that C is sufficiently large depending on a, the 
size of Q* is bounded by 



\Q*\ = msix{l,Oo,{ip*)-^Vn/n) = max(l, 0«(-^)) = 1. 

This contradiction completes the proof of our corollary. 

D 



5. Proof of Theorem 13.21 



We will invoke Theorem 4.4 Define a radius sequence (/3fc)§° where /3o := n and 



Pi+1 — n 'Pi. 



Let V be the multi-set of ui, . . . , Vn- Then the assumption of Theorem 3.2 becomes 

Pp:{V)>n~''. 

with either s = s — loxs = s + \. 

Next, because the increasing sequence p*a.{y) is bounded from above by 1, by pigeonhole 
principle there exists < /cq < 2B /a such that 
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/'L..(^)<-"/VL(^)- 



As A was chosen to be sufficiently large, one has P^o — ''^ • We next apply Theorem 4.4 
to V with n' = n^/^+a ^nd /3 = /3fc(, to obtain a GAP Q* = {go + Yll=i hgi, l^il < Ki} for 
which the following holds. 

• Q* has small rank r = 0(1), and small cardinality 

|Q*|=max(l,0,,B((p;^^(F))-Vn")). 

• There are no := n — n-'^''^"'"" elements Vj^, . . . , f j^ of ^ which are 0(/3fcQn^^+^)-close 
to n — n^/'^+'^ elements ui, . . . , ti„o of Q*. 



• There is a non-zero integer p = 0(vn^/^+") such that all steps gi = {gii,gi2),0 < 
i < r oi Q* have the form gij = j5^ Pij/p with pij E Z and Py = 0(/3^ vni/2+"). 
In particular, all the components of the elements of Q* have the form p/q where 

I I I I ^ 4-4-1 

IpI, I^I < n ^ . 

Next, for each v of the remaining 71^/^+" exceptional elements of V (which are not close to 
any element of Q*), we trivially approximate it by a complex number v whose components 
are rational numbers of the form p/q with \q\ < n ~^^ such that |it — uj < /Sfc^n^ ^^. 

By the approximation we infer that 

||u-v|| = (j;K-^,|Y/2</3,„n5^+V2. 

i 

Taking into account that \fij\ < n"^, we thus have 

where U is the multi-set {ui, . . . , Un}- 

From the estimate above, as p*p {V) < n°'''^p*a (V), it is implied that 

pl^^.B^.^AU)<n'^/'pX^{V). 
So the size of Q* is bounded by 



|Q*| = max (l,0((p;^^„5s+4+.(C/))-Vn"/'))- 
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In summary, we have obtained a vector u = (ui, . . . , u„) which satisfies the fohowing prop- 
erties. 

• ||u|| X 1, and because (v,rj(X)) = for any row rj of X of index i > 2, we also 

haveKu,ri(X))|</3fcon5^+4+^. 

• There exists a generahzed arithmetic progression Q* of rank OB,a{^) and size |Q*| = 
max(l,0((p* ,^5s+4+^(f/))~"^/n"'^) j that contains at least n — n^/^+" complex 
numbers Ui. 

• All the components of Ui and of the generators of Q* are rational numbers of the 
form p/q, where \p\, \q\ < n ^'^. 



This completes the proof of Theorem 3.2 



6. Proof of Theorem 13.31 



By applying Theorem 3.2, we obtain a structural vector u which satisfies all the described 
properties. Because the number of /3 is bounded by a constant, it is enough to verify 
Theorem |3.3| for one such (3. By paying a factor of n in probability, we assume that 



|(u,ri(X))| < /Sn^^+^+T for the last n - 1 rows of X. 

Set /3' := firt'^^^'^'^ . We will consider two cases depending on the structure of u. 

6.1. Degenerate u. We first consider the probability Pmajor of the event |(rj, u)| < /3', 2 < 
i < n, for which there are no := n — ri^/'^~^°' complex numbers Uj which can be /3'n^- 
approximated by a common point u'q E /3'n^ • Z'^. 

By paying a factor (^ ) in probability, we may assume that this point approximates the first 
no complex numbers ui, . . . , Uno- Thus, by approximating the remaining Ui by u[ G 13' n^ ■ T? 
such that \ui — n^| < /3'n^, the events |(rj,u)| < /?' belongs to the event |(rj,u')| < /3'n^, 
where u' = (n'^, . . . ,ii']^, u^^^;^, . . . ,?x^) and ||u'|| x 1. 

Let -'^(n-i)xn be the matrix generated by the last n — 1 rows of X, and let X' be the n — 1 
by n — no matrix obtained from X(ri-i)xn by joining its first ng columns, 

X = \ci{Xf^n-l)y.n) + • • • + C„o (X(„_i)xn)) C„o+l(X(-„_i)xn)) • • • ^ ^n{X (^ri~l)xn)\ ■ 

By definition, the row vectors of X' satisfy |(rj(X'), u[j.)| < /3'n^ where vl[^ := {u'^,u'^f^^-^^, . . . , u'„). 
It also follows from definition that the i-th row of X' has the form ri{X') = x'+f , where f = 

\Jil~r ' ' ' ~r Jinoj /i(no+l)i • • • ' Jin) a-no. X ^ (Xi + • • • -\-XnQ, 3;^p_|_i, . . . , Xn) -^ [Xi, ■ ■ ■ , ^n—no)' 

As X is sampled uniformly from S, the set of all ( — 1,1) vectors of entry-sum either s — 1 
or s + 1, x' is a random vector chosen from type 1 or type 2 defined below. 
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Type 1. (row-sum s + 1) 



( no \( n-no ■] 

-p/ / _ ; \ _ y{no+k)/2j \{n-no+s+l-k)/2) 
f[Xi - fcj - - - :—— - 



U/2+(s-l)/2J + U/2+(s+l)/2J 

for all k such that k + no is even; and (x2, . . . , x'^-no) ^^^ chosen uniformly from all ( — 1, 1) 
vectors of row-sum s + 1 — x'^. 

Type 2. (row-sum s — 1) 

P(x'i = k) -- 



( no \( n-no \ 

\(no+k)/2) \(n-no+s-l-k)/2) 

Vn/2+(s-l)/2J + Vn/2+(s+l)/2J 



for all k such that k + riQ is even; and (xg, . . . , x'j^-no) ^^^ chosen uniformly from all (—1, 1) 
vectors of row-sum s — 1 — x'l- 



It is clear that 



■r,/ / , -,\ U/2+(s+l)/2J 

P(x' G type 1) - / ^y -r J/ 



\n/2+(s-l)/2J + Vn/2+(s+l)/2J 

and 



T-./ / i r,N U/2+(s-l)/2J 

P(x G type 2) - ' ^ " 



U/2+(s-l)/2/ + U/2+(s+l)/2J 

Observe that as |s| < (1 — e)n, these two probabilities are comparable, each of which can 
be bounded crudely from below by (1 — e)/4. 

We next apply the following result. 

Claim 6.2. Let e < 1/4 be a fixed constant. Let u^j. = {u'l,u'^^_^_l, . . . ,u'„) be a vector in 
which the components of each complex u'^ is of the form /3'n^ • Z and such that nolti'xP + 
l^'no+il^ + • • • + l^i'raP >i 1- Then, as n is sufficiently large and f is a fixed vector, one has 



F^KKx + f ,<)| < /3V) < 1 - (1 - e)/8. 



Proof, (of Claim 6.2) We will consider two main cases below. 



(i) We first assume that there exists 1 < io < Jo such that \u[^^ — u'-\ > (3'n^. Without 
loss of generality, assume that iq = n — 1 and jo = n. It follows from the distribution of 
x' that the event of having exactly one —1 among the last two components of x' happens 
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with probability at least (1 — e)/4 asymptotically. Within this event, observe that for any 
tuple (x'l, . . . , x;_„^_2), either x = {x[, . . . , x'^_^^_2, -1, 1) or x = (x'l, . . . , x'^_^^_2, 1, -1) 
does not satisfy |(x', u[j.) + (f , u[^)\ < /3'n^. Thus we have 



P.KI(x',uU + (f ,u;,)| < /3V) <!-(!- 6)/8. 

(ii) Assume otherwise that there exists u' such that all \u' —u'^^^i\, . . . , \u' — u'^l are bounded 
by f3'n^. In this case, the inequality |(x',u(-j,) + (f',u(.j,)| < /3'n^ implies that 



\x'M - n') + n'{x[ + x'2 + • • • + <_„„) + (f , Ol < P'n^. (7) 

We next consider the subcase \ui — u'\ > (3'n^ . If x'^ + X2 + • • • + x'j^_n^ = s + 1, then ([T]) 
implies that x'l belongs to the interval [(— n'(s + 1) — (f, u(.j,))(/3'n^)~^ — l/n?,{—u'{s + 
1) — (f , u(.j.))(/3'n®)~^ + 1/n^] . However, because this interval has length 2/n^, and so this 
probability is clearly bounded by supjr. P(x']^ = k), which is clearly smaller than 1 — (1 — e)/4. 
We argue similarly for the case rc'^ + X2 + • • • + x'^^-no = s — 1. 

For the remaining subcase \u'i — u'\ < /3'n*, as A was chosen to be large enough, we have 
m— "w'l < n~^. Next, because ||u(.j,|p = no|nQp+(n— no)|u'p x 1, we infer that \u'\ x l/y/n. 
It then follows that 



|n'(x; + --- + xUj + (f',u;)l</3V. (8) 

However, as x'^ + • • • + x^.^^^ takes value s + \ and s — 1 each with probability at least 
(1 — e)/4, the equation Q above holds with probability at most 1 — (1 — e)/4. 

D 

Now we estimate Pmajor- As the event |(rj(X),u')| < /3'n^ is controlled by |(rj(X'), u[j.)| < 
/3'n^, and By Claim 6.2 the later holds with probability (7 + e)/8, it follows that the 



probability that \{Yi{X), u')| < /J'n^ for aU 2 < i < n is bounded by ((7 + e)/8)"-^ 

Additionally, an elementary computation implies that the number of structural vectors 
u' e (/3'n^ • z2)"-'^o+i satisfying ||u'|| x 1 is bounded by 



Putting together, we obtain the following bound for Pmajor 



no J \n — no — 1 
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Remark 6.3. In the treatment above the fact that x' takes either type 1 or type 2 with 
comparable probabihty is crucial. The assumption of just one type would not be enough to 
estimate Pmajor unless we had an additional assumption on u', say u'i + ■ ■ ■ + u'^ is nearly 
zero. 



6.4. Non-degenerate u. We consider the probability Pminor of the event that there exists 
a vector u for which |(rj(X), u)| < /3' ,2 < i and the following holds 

• ||u|| X 1 and there does not exist any u which is /3'n^-close to all but n^/'^+°' complex 



numbers Ui. Thus it follows from Corollary 4.6 that 



• There exists a generalized arithmetic progression Q* of rank OB,a{^) and size |Q*| = 
max(l,0(p^,([/)~V^"''^)) = 0(/9^,(C/)-Vn"/^) that contains at least n - 71^/2+" 
complex numbers Ui. (Here we used the estimate p*a,{U)~^ = Q(n") to eliminate 
the trivial constant 1 in the size estimate oi Q*.) 

• All the components of Ui and of the generators of the generalized arithmetic pro- 
gression are rational numbers of the form p/q, where \p\, \q\ < n ~^^. 

Let < 5 to be chosen (any 6 < a/3 will suffice) . We divide the interval [n^ , Oa{n~°'''^)] 
into sub-intervals [n~^ '^^> ,n^ ], where a/26 < k < B/5. For each k, let G^ be the 
collection of u's such that p*o,{U) G [n^^^^' ,n~ ], and let Pfc be the probability that 
\{ri{X'), u)| < f3' for all i and for one of u from G^. 

We now bound the size of G^. To do this, we first count the number of GAPs which may 
contain most of the Ui of vectors u from Gfc, and then count the number of u's whose Uj are 
chosen from the determined structure. Recall that all components of the GAP generators 
are of the form p/q, where |p|, |g| < n ~^^. Because each GAP has rank 0_b,q,(1) and size 
0{{p*)-^/n'^/'^) = 0(n'5('=+i)/n"/2)^ the number of such GAPs is bounded by 



^^4A+4^0s,c.(i)(„<5(fc+i)/^a/2)Os,„(i) ^ o(n'^s-">*(^)). 

After choosing a Q* of size 0{n^^~^^' /n"'"^), the number of ways to choose n — n^'^+" 
complex numbers Ui as (5*'s elements is 

For the remaining n^""*"" exceptional elements, there are {n ~^^)'^ ° = 0{n ^^"^ "') 
ways to choose them. Putting these bounds together, we obtain the following bound for the 
number of u of G^ 
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IGfcl = 0(nOA,B..,.(n^/^+'^)(„<5(fc+l)/^a/2)n-„V2+.^_ 



Now, for a given u G G^, the probability that |(rj(X), u)| < /?' for all 2 < i < n is bounded 



by (P0'(u))" ^ < {n ''^)" ^. Thus we can estimate P^ as 



Pfc < \Gk\{n-^''T'^ = O ('n°A-^---^("'^'^")(n'^)"/(n"/2)'^-"'/'+°') = o{n-'''''^) 



provided that 5 was chosen to be smaller than a/3. 
Summing over k, we thus obtain 



^minor — / ^ "k — 0(71 



-an/6\ 
k<B/S 

Remark 6.5. One observes that the saving factor l/n"^' ^ in the size of Q* plays a key role 



in our analysis here. This explains the necessity of Theorem 4.4 



7. Concentration of distance 



We now give a proof of Lemma 2.5 basing on [25]. Let P = {pij) be the n by n orthogonal 
projection matrix from C" to V-^. Thus P is Hermitian and P^ = P. We first normalize x'^ 
by setting y[ := x[ — s/n and fl := fi + s/n for 1 < i < n. We then have Ey^ = 0, Var(2/^) = 
1 - (s/n)^ and 



d'' = i|p(f +x')f = \\p{i'+y')f = Ylp^M + f^M + r,) 

= ^Pijy'iVj + ^yiiPijfj+Pjifj) + ^Pijf'ifj 

ij ij ij 

:= {n-k) + 4 + Y. 
It is clear that Ey = 0, thus 

B{d^) = {n-k) + dl. 
Note that 
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= E| Y.^^,y[yf + E| j; y\(j>,J'^ + p,,/j) 



(1 - {sinf)\j^\M^ + j; I Y.v^r, + E^^-^/ii 



<Ei^^.i'+4EiE^.^/ii' 

= E \p^^\' + 4 E^^u.44 = E^l + 44 

«7^i iii2 «^j 

Next, because YliPa = {n — k), by Cauchy-Schwarz inequality 

Vp?i > {n-kf/n. 



Thus 



E l^*il^ ~ E l^*il^ ~ Z_^Pii ^ {n — k) — (n — k)'^/n < min(A;, n — k). 
It is implied that 

Ey^ < min(fc, n - k) + Adj,. 



Consider the event d > \ n — k + d"^, +3. The probability of this event is bounded from 
above by 



P{d'^ >n-k + dl + QJn -k + dj,) 



= P{Y>6^Jn-k + dl) 
<P(y2 > 36(n-/c + 4)) 

Ey2 1 

- 36{n-k + dl) " 9' 
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Similarly, consider the event d! < \ n — k + d'^, — ?>. The probability of this event is bounded 
from above by 



P(d'2 <n-k + dl - &Jn -k + d1, + 9) 



= P{Y < -6\Jn -k + d1,+9) 

< P(y2 > 36(n -k + 4) - 108 Jn - k + dj, + 8l) 



Ey2 1 

- ^ - ~ 

m{n-k + dj,)-108Jn-k + dl + 81 ^ 



provided that k < n — 10. 

Thus the median M of d' satisfies \M — \ n — k + d'^i\ < 3. 



Since the distance function is convex on { — 1, 1}"" with Lipschitz constant 1. Talagrand's 
concentration inequality pS^ implies that for any t 



V{\d' -M\>t)< 4exp(-tVl6). 
follows. 



Since \M — \ n — k + d^i\ < 3, Lemma 



2.5 
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